extract highlighted text from pdf